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DETAILED ACTION 
Application Status 

1. Claims 1-3, 7-9, 11, and 21-22 are pending in the application. 

2. Applicant's amendment to the claims, filed on 17 July 2006, is acknowledged. 
This listing of the claims replaces all prior versions and listings of the claims. 

3. Applicant's amendment to the specification, filed on 17 July 2006, is 
acknowledged. In view of this statement, sequence compliance appears to be 
perfected. 

4. Applicant's arguments filed on 17 July 2006 in response to the Office action 
mailed on 7 March 2006 have been fully considered and are deemed to be persuasive 
to overcome at least one of the rejections and/or objections previously applied. 
Rejections and/or objections not reiterated from previous office actions are hereby 
withdrawn. 

5. The text of those sections of Title 35, U.S. Code not included in the instant action 
can be found in a prior Office action. 

Claim Rejections - 35 USC § 112, Second Paragraph 

6. Claims 9 and 21-22 are rejected under 35 U.S.C. 1 12, second paragraph, as 
being indefinite for failing to particularly point out and distinctly claim the subject matter 
which applicant regards as the invention. 

Claims 9 and 21 are confusing in the recitation of "[a]n isolated polypeptide 
defined by a variant... the variant consists of a single amino acid substitution..." 
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According to MPEP 2111 .03, "[t]he transitional phrase 'consisting of excludes any 
element, step, or ingredient not specified in the claim." Thus, claims 9 and 21 would 
appear to read on a polypeptide variant consisting of a single amino acid as defined by 
the claims. Claim 22 is also rejected as being confusing in the use of the transitional 
phrase "consists of in the recitation of "[a]n isolated polypeptide defined by a variant" of 
SEQ ID NO:5 "wherein the variant consists of a substitution of the amino acids at 
positions 255-258." In the interest of advancing prosecution, claims 9 and 21 have been 
interpreted as meaning a variant of SEQ ID NO:3, 5, or 6 with a single amino acid 
mutation, wherein the mutation is at position 73 or 81 . Claim 22 has been interpreted as 
meaning the polypeptide of SEQ ID NO:5, wherein amino acids 255-258 of SEQ ID 
NO:5 are replaced with SEQ ID NO:7, 8, 9, 10, 11, 12, 13, or 14. It is suggested that 
applicant clarify the meaning of the claims. 

Claim Rejections - 35 USC §112, First Paragraph 

7. Claim 22 is rejected under 35 U.S.C. § 1 12, first paragraph, written description, 
as containing subject matter which was not described in the specification in such a way 
as to reasonably convey to one skilled in the relevant art that the inventor(s), at the time 
the application was filed, had possession of the claimed invention. This is a new matter 
rejection. 

New claim 22 is drawn to a polypeptide "defined by a variant of... SEQ ID NO:5, 
wherein the variant consists of a substitution... at positions 255-258 with SEQ ID 
NO:7,8,9,10,1 1,12,13, or 14. MPEP § 2163 states, "when filing an amendment an 



Application/Control Number: 09/825,423 Page 4 

Art Unit: 1656 

applicant should show support in the original disclosure for new or amended claims" 
and "[i]f the originally filed disclosure does not provide support for each claim limitation, 
or if an element which applicant describes as essential or critical is not claimed, a new 
or amended claim must be rejected under 35 U.S.C. 112, para. 1 , as lacking adequate 
written description." Applicant points to original claim 7, p. 11 of the specification, and 
"by comparing the full-length sequence in SEQ ID NO:1 with the subdomain I, II 
fragment sequence in SEQ ID NO:5" (instant response at p. 5, bottom). The examiner 
has reviewed applicant's cited supporting disclosure and has aligned SEQ ID NO:1 
against SEQ ID NO:5 (see Appendix C). However, this disclosure does not appear to 
support the claimed variant polypeptide. It is suggested that applicant show support for 
new claim 22. If applicant maintains that the cited disclosure supports claim 22 as 
written, applicant is requested to provide a detailed explanation as to how this cited 
disclosure supports the polypeptide of claim 22. 

8. The written description rejection of claim 1 1 under 35 U.S.C. § 1 12, first 
paragraph, is maintained for the reasons of record and the reasons stated below. The 
rejection was fully explained in the prior Office action. 

RESPONSE TO ARGUMENT: Applicant argues the rejection is overcome by 
claim amendment to "define the polypeptide in the crystalline composition by both a 
specific amino acid sequence and a specific set of structural coordinates." 

Applicant's argument is not found persuasive. The examiner maintains the 
position that the specification fails to describe the genus of crystalline compositions of 
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claim 1 1 . While the amendment to the claims limits the polypeptide of the composition 
as the recitation of "crystalline composition" in claim 1 1 does not specifically define any 
of the crystalline compositions that fall within its definition, particularly as the recitation 
of "crystalline composition" does not define any structural features commonly 
possessed by members of the genus of crystalline compositions of SEQ ID NO:17 that 
distinguish them from others. One skilled in the art therefore cannot, as one can do with 
a fully described genus, visualize or recognize the identity of the members of the genus 
of proteins in crystalline form. In this case, the structure(s) of the genus of crystals of the 
protein of SEQ ID NO: 17 is completely undefined. 

Applicant appears to take the position that by virtue of limiting the polypeptide of 
the crystalline composition to SEQ ID NO:17 having the structural coordinates of Table 
5, the genus of crystals is adequately described, however, it is well-known in the art that 
a single polypeptide can crystallize into a plurality of distinct crystal forms, which one 
cannot predict a priori (see, e.g., Aleshin et al. FEBS Lett 434:42-46, 1998). Thus, as 
noted in the prior Office action, the genus of crystals encompasses species that are 
widely variant, encompassing crystals of unliganded and liganded forms of SEQ ID 
NO: 17, wherein the liganded form is in complex with any ligand(s). In this case, the 
specification discloses only a single representative species of the genus of recited 
crystalline compositions, i.e., a protein crystal of SEQ ID NO:17 having space group P2<\ 
and unit cell dimensions a=34.8 A, b=67.1 A, c=58.4 A, a=K=90°, and 0=101.3° (see 
particularly pp. 41-42 of the specification, which teaches crystallization of SEQ ID 
NO:17, which is amino acids 181-324 of the HCV NS3 helicase of SEQ ID NO:1), which 
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is undisputed by applicant. Other than these single species, the specification fails to 
describe any other crystals of SEQ ID NO:17 as encompassed by the claims. MPEP § 
2163 states "[f]or inventions in an unpredictable art, adequate written description of a 
genus which embraces widely variant species cannot be achieved by disclosing only 
one species within the genus." As such, the single disclosed species of crystals of SEQ 
ID NO: 17 fails to describe all crystals as encompassed by the claim. 

Given the lack of description of a representative number of protein crystals, the 
specification fails to sufficiently describe the claimed invention in such full, clear, 
concise, and exact terms that a skilled artisan would recognize that applicant was in 
possession of the claimed invention. 

9. The scope of enablement rejection of claim 1 1 under 35 U.S.C. 112, first 
paragraph, is maintained for the reasons of record and the reasons stated below. The 
rejection was fully explained in the prior Office action. 

RESPONSE TO ARGUMENT: Applicant argues the rejection is overcome by 
claim amendment to "define the polypeptide in the crystalline composition by both a 
specific amino acid sequence and a specific set of structural coordinates." 

Applicant's argument is not found persuasive. The examiner maintains the 
position that the specification fails to enable all crystals as broadly encompassed by the 
claim. While the examiner acknowledges the amendment to limit the polypeptide of the 
crystal to SEQ ID NO:17 having the structural coordinates of Table 5, claim 1 1 
nonetheless broadly encompasses all crystals of SEQ ID NO: 17, unliganded or 
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complexed with any ligand, having any space group, and any unit cell dimensions. The 
specification discloses only a single working example of the claimed crystal, i.e., a 
crystal of SEQ ID NO:17 having space group P2<\ and unit cell dimensions a=34.8 A, 
b=67.1 A, c=58.4 A, gt=k=90°, and £=101.3° (see particularly pp. 41-42 of the 
specification, which teaches crystallization of SEQ ID NO:17, which is amino acids 181- 
324 of the HCV NS3 helicase of SEQ ID NO:1). The specification fails to disclose any 
other working examples or guidance for making other protein crystals of SEQ ID NO:17 
under any other conditions with an expectation of obtaining diffraction-quality crystals. 
As noted in the prior Office action - and undisputed by application - the state of the art 
at the time of the invention acknowledges a high level of unpredictability for making a 
protein crystal. For example, the reference of Branden et al. ("Introduction to Protein 
Structure Second Edition", Garland Publishing Inc., New York, 1999; cited in the prior 
Office action) teaches that "[c]rystallization is usually quite difficult to achieve" (p. 375) 
and that "[w]ell-ordered crystals... are difficult to grow because globular protein 
molecules are large, spherical, or ellipsoidal objects with irregular surfaces, and it is 
impossible to pack them into a crystal without forming large holes or channels between 
the individual molecules" (p. 374). Also, Drenth et al. ("Principles of X-ray 
Crystallography," Springer, New York, 1995; cited in the prior Office action) teaches that 
"[t]he science of protein crystallization is an underdeveloped area" and "[p]rotein 
crystallization is mainly a trial-and-error procedure" (p. 1). One cannot predict a priori 
those conditions that will lead to the successful crystallization of a diffraction-quality 
crystal nor can one predict the space group symmetry or unit cell dimensions of the 
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resulting crystal. As stated above, even a single polypeptide can have multiple crystal 
forms, however, what form will result from which particular crystallization conditions - if 
any - remains highly unpredictable as evidenced by the state of the art at the time of 
the invention. While applicant may argue that a crystal of SEQ ID NO: 17 in complex 
with a ligand or ligands can be prepared according to the disclosed method and would 
have the same space group and unit cell dimensions, there is no way to predict a priori 
the space group and unit cell dimensions of a protein, as evidenced by the references of 
Kierzek et al. (cited in the prior Office action; see cited relevant teachings). While 
methods of protein crystallography were known at the time of the invention, it was not 
routine in the art to make all polypeptide crystals as encompassed by the claims and 
screen for those that are diffraction-quality under any crystallization conditions as 
encompassed by the claims, diffract those crystals, and to determine those polypeptide 
crystal structures that represent biologically-relevant macromolecules. 

In view of the overly broad scope of the claims, the lack of guidance and working 
examples provided in the specification, the high level of unpredictability as evidenced by 
the prior art, and the amount of experimentation required to make and use all crystals 
and make and use all three-dimensional structures and methods of "rational drug 
design" as broadly encompassed by the claims, undue experimentation would be 
necessary for a skilled artisan to make and use the entire scope of the claimed 
invention. 



Conclusion 
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1 0. Status of the claims: 

Claims 1 -3, 7-9, 1 1 , and 21-22 are pending. 

Claims 1-3 and 7-8 appear to be in a condition for allowance. 

Claims 9, 11, and 21-22 are rejected. 

Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David J. Steadman whose telephone number is 571- 
272-0942. The examiner can normally be reached on Mon to Fri, 7:30 am to 4:00 pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Kathleen Kerr can be reached on 571-272-0931. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). ^ „ 



David JT Steadman, Ph.D. 
Primary Examiner 
Art Unit 1656 
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APPENDIX A 



CLUSTAL W (1.83) Multiple Sequence Alignments 
Sequence type explicitly set to Protein 



Sequence format 


is Pearson 




Sequence 1 


: SEQ_ 


_ID_N0_3 


435 aa 


Sequence 2 


: SEQ] 


~ID_N0_5 


845 aa 


Sequence 3 


: SEQ] 


~ID_N0_6 


708 aaSeq 


SEQ_ID NO 


17 


423 aa 






Start of Pairwise alignments 




Sequences 


(1:2) 


Al igned . 


Score : 


100 


Sequences 


(1:3) 


Aligned. 


Score : 


100 


Sequences 


(1:4) 


Aligned. 


Score : 


100 


Sequences 


(2:2) 


Aligned. 


Score : 


100 


Sequences 


(2:3) 


Aligned. 


Score : 


69.9153 


Sequences 


(2:4) 


Aligned. 


Score : 


100 


Sequences 


(3:2) 


Al igned . 


Score : 


69.9153 


Sequences 


(3:3) 


Aligned. 


Score : 


100 


Sequences 


(3:4) 


Al igned . 


Score : 


100 


Sequences 


(4:2) 


Aligned . 


Score : 


100 


Sequences 


(4:3) 


Aligned. 


Score : 


100 


Sequences 


(4:4) 


Aligned. 


Score : 


100 



Start of Multiple Alignment 



SEQ_ 


_ID_ 


_NO_ 


_6 


SEQ_ 


"id" 


"no" 




SEQ_ 


~Id] 


]no~ 


"3 


seq" 


"id" 


"no" 


"5 



GLYSERHISMETSERPRVALPHETHRASPASNSERSERPRPRALAVALPRGLNSERPHEG 

SERPRVALPHETHRASPASNSERSERPRPRALAVALPRGLNSERPHEG 

GLYSERHISMETSERPRVALPHETHRASPASNSERSERPRPRALAVALPRGLNSERPHEG 

_ _ _ GLYSERHISMETSERPRVALPHETHRASPASNSERSERPRPRALAVALPRGLNSERPHEG 

it*********************************************** 

SEQ_ID_N0_6 LNVALALAH I S LEUHI SALAPRTHRGLYS ERGLYLYS S ERTHRLYS VALPRALAALAT YR 

SEQ_ID_N0_17 LNVALALAH I S LEUH I S ALA PRTHRGL YS E RGL YL YS S ERTHRLYS VAL PRALAALATYR 

LNVALALAHI S LEUH I SALAPRTHRGLYS ERGLYLYS SERTHRLYS VALPRALAALAT YR 
LNVALALAHISLEUHISALAPRTHRGLYSERGLYLYSSERTHRLYSVALPRALAALATYR 

ALAALAGLNGLYTYRLYSVALLEUVALLEUASNPRSERVALALAALATHRLEUGLYPHEG 

ALAALAGLNGLYTYRLYSVALLEUVALLEUASNPRSERVALALAALATHRLEUGLYPHEG 

ALAALAGLNGLYTYRLYSVALLEUVALLEUASNPRSERVALALAALATHRLEUGLYPHEG 

ALAAIAGLNGLYTYRLYSVALLEUVALLEUASNPRSERVALALAALATHRLEUGLYPHEG 
************************************************************ 

LYALATYRMETS ERLYS ALAH I SGLYVALAS PPRASNI LEARGTHRGLYVALARGTHRI L 

L YALAT YRMET S ERLYS ALAH I S GLYVALAS PPRASNI LEARGTHRGLYVALARGTHR I L 

LYALATYRMETS ERLYS ALAH I SGLYVALASPPRASN I LEARGTHRGLYVALARGTHR I L 

LYAIJVT YRMET S ERLYS ALAH I SGLYVALAS PPRASNI LEARGTHRGLYVALARGTHR I L 
************************************************************ 

E THRTHRGL YS ER PR I LE THRTYRS ERTHRT YRGL YLYS PHE LEUALAAS PGL YG L YC YS 
ETHRTHRGLYSERPRILETHRTYRSERTHRTYRGLYLYSPHELEUALAASPGLYGLYCYS 
ETHRTHRGLYSERPRILETHRTYRSERTHRTYRGLYLYSPHELEUALAASPGLYGLYCYS 
ETHRTHRG L YS E R PR I LETHRT YRS ERTHRT YRGL YLYS PHE LEUALAAS PGL YG L YC YS 
************************************************************ 

S ERGLYGLYALAT YRAS P I LE I LE I LECYS AS PGLUCYSH I S S ERTHRAS PALATHRS ER 

S ERGLYGLYALATYRAS P I LE ILE I LECYS AS PGLUCYSHI SS ERTHRASPALATHRS ER 

S E RG LYGL YALAT YRAS P I LE I LE I LECYS AS PGLUC YS H I S S ERTHRAS PALATHRS ER 

SERGLYGLYALATYRAS PILE ILE I LECYSAS PGLUCYSHI SSERTHRAS PALATHRS ER 
************************************************************ 

I LELEUGLYI LEGLYTHRVALLEUAS PGLNALAGLUTHRALAG L YALAARGLEUVAL VAL 
I LELEUGLYI LEGLYTHRVALLEUAS PGLNALAGLUTHRALAGLYALAARGLEUVALVAL 
I LELEUGLYI LEGLYTHRVALLEUAS PGLNALAGLUTHRALAGLYALAARGLEUVALVAL 
I LELEUGLYI LEGLYTHRVALLEUAS PGLNALAGLUTHRALAGLYALAARGLEUVALVAL 



SEQ_ 


_ID_ 


_NO_ 


_3 


SEQ_ 


! ID ! 


"no] 


]5 


SEQ_ 


_ID_ 


_NO_ 


6 


SEQ 


"id] 




]l7 


SEQ_ 




"no" 


]3 


SEQ_ 


"id] 


_NO_ 


]s 


SEQ_ 


id_ 


_NO_ 


6 


SEQ_ 


]id] 


]no~ 


]l7 


SEQ_ 


~ID_ 


]no~ 


]3 


SEQ~ 


id] 


]no] 


]5 . 


SEQ_ 


ID_ 


_NO_ 


_6 


SEQ~ 


]id] 


]no~ 


]l7 


SEQ_ 


[id] 


]no_ 


]3 


SEQ_ 


"id] 


]no_ 


]5 


SEQ_ 


_id_ 


_NO_ 


6 


SEQ_ 


id] 


]no" 


]l7 


SEQ_ 


id] 


]no~ 


]3 


SEQ~ 


id" 


]no~ 


]s 


SEQ 


ID_ 


_NO_ 


6 


SEQ_ 


id] 


]no~ 


17 


SEQ 


id" 


]no~ 


3 


SEQ 


id" 


]no~ 


5 
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S EQ_I D_NO_6 LEUALATHRALATHRPRPRGLYSERGLYMET PHEAS PS ERSERVALLEU 

S EQ_ID_NO_l 7 LEUALATHRALATHR - 

S EQ_I D_NO_3 LEUALATHRALATHR - 

SEQ_ID_NO_5 LEUALATHRALATHRPRPRGLYSERVALTHRVALPRHISPRASNILEGLUGLUVALALAL 

*************** 

SEQ_ID_NO_6 CYSGLU- -CYSTYRASPALAGLYCYSALATRPTYRG 

SEQ_ID_NO_17 

SEQ_ID_NO_3 

SEQ ID NO 5 EUSERTHRTHRGLYGLUILEPRPHETYRGLYLYSALAILEPRLEUGLUVALILELYSGLY 



SEQ_ 


ID_ 


_NO_ 


6 


SEQ_ 


id" 


"no" 


"l7 


SEQ_ 


id" 


"no" 


3 


SEQ_ 


id" 


~no_ 


~5 


SEQ_ 


ID_ 


_no_ 


6 


SEQ_ 


id] 


"no" 


11 


SEQ_ 


'id" 


"no" 


3 


SEQ_ 


ID~ 


"no" 


^5 



LU LEUTHRPRALAGLUTHRTHR 



GLYARGHISLEUILEPHECYSHISSERLYSLYSLYSCYSASPGLULEUALAALALYSLEU 



VALARGLEU ARGALATYRMETASNTHRPRGLYLEU PRV 



VALALALEUGLYI LEASNALAVALALATYRTYRARGGLYLEUAS PVALS ERVAL I LE PRT 



SEQ_ID_NO_6 ALCYSGLNAS PHIS LEU GLUPHETRPGLU GLYVALPHETHRGLYLEU- 

SEQ_ID_NO_l 7 

SEQ_ID_NO__3 

SEQ ID NO 5 HRASNGLYAS PVAL VALVAL VALALATHRAS PALALEUMETTHRGLYPHETHRGLYAS PP 



SEQ_ID_NO_6 THRH I S I LEAS PALAH I S PHELEU - SERGLNTH 

SEQ_ID_NO_17 

SEQ_ID_NO_3 

SEQ ID NO 5 HEASPSERVALI LEAS PCYSASNTHRSERAS PGLYLYSPRGLNAS PALAVALSERARGTH 



S EQ_ID_NO_6 RLYSGLNSERGLYGLUASNPHE PRTYRLEUVALALATYRGLNALATHRVALCYSALAARG 

SEQ_ID_NO_17 

SEQ_ID_NO_3 

SEQ ID NO 5 RGLNARGARGGLYARGTHRGLYARGGLYLYS PRGLYI LETYRARG PHEVALALAPRGLYG 



S EQ_I D_NO_6 ALAGLN 

SEQ_ID_NO_17 

SEQ_ID_NO_3 

SEQ ID NO 5 LUARG- 
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APPENDIX B 



Seql is SEQ ID N0:1, Seq2 is SEQ ID NO: 17 

S-w opt: 2614 Z-SCOre: 3193.8 bits: 602.0 E(): 4.3e-176 
Smith- Waterman score: 2614; 100.000% identity (100.000% ungapped) in 412 aa overlap (510-921:1- 
412) 

480 490 500 510 520 530 

Seql EILEPRVALGLASNLEGLTHRTHRMETARGSERPRVALPHETHRASPASNSERSERPRPR 



Seq2 SERPRVALPHETHRASPASNSERSERPRPR 

10 20 30 

540 550 560 570 580 590 

Seql ALAVALPRGLNSERPHEGLNVALALAHISLEHISALAPRTHRGLYSERGLYLYSSERTHR 



Seq2 ALAVALPRGLNSERPHEGLNVALALAHISLEHISALAPRTHRGLYSERGLYLYSSERTHR 

40 50 60 70 80 90 

600 610 620 630 640 650 

Seql LYSVALPRALAALATYRALAALAGLNGLYTYRLYSVALLEVALLEASNPRSERVALALAA 



Seq2 LYSVALPRALAALATYRALAALAGLNGLYTYRLYSVALLEVALLEASNPRSERVALALAA 

100 110 120 130 140 150 

660 670 680 690 700 710 

Seql LATHRLEG L Y PHEG LYALAT YRMET S E RL YS ALAHI SGL YVALAS P PRASN I LEARGTHR 



Seq2 LATHRLEGL YPHEGLYALATYRMETSERLYSALAH I SGL YVALAS P PRASN I LEARGTHR 

160 170 180 190 200 210 

720 730 740 750 760 770 

Seql G L YVALARGTHR I LETHRTHRGL YS E R PR I LETHRT YRS E RTHRT YRGL YL YS PH E LE AL 



Seq2 GL YVALARGTHR I LETHRTHRG L YS ER P R I LETHRT YRS ERTHRT YRGL YLYS PHELEAL 

220 230 240 250 260 270 

780 790 800 810 820 830 . 

Seql AAS PG L YG L YC YS S ERGL YGL YALAT YRAS PILEILEILECYSASPG LC YSH I S S ERTHR 

******** • *••*•*•*•»•••••••••*■• * * * • 

Seq2 AASPGLYGLYCYSSERGLYGLYALATYRASPILEILEILECYSASPGLCYSHISSERTHR 

280 290 300 310 320 330 

840 850 860 870 880 890 

Seql AS P ALATHRS E R I LELEGLYILEGLYTHRVALLEAS PGLNALAG LTHRALAG L YALAARG 



Seq2 AS PALATHRS E R I LELEGLYILEGLYTHRVALLEAS PGLNALAG LTHRALAG L YALAARG 

340 350 360 370 380 390 

900 910 920 930 940 950 

Seql LEVALVALLEALATHRALATHRPRPRGLYSERVALTHRVALPRHISPRASNILEGLGLVA 



Seq2 



LEVALVALLEALATHRALATHR 
400 410 
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APPENDIX C 

>>Seq2 (820 aa) 

S-w Opt: 4989 2-SCOre: 6113.3 bits: 1143.2 E(): 0 
Smith-Waterman score: 4989; 93.103% identity (98.901% ungapped) in 870 aa overlap (496-1364:1- 
820) 

470 480 490 500 510 520 

Seql LYSALAVALASPPHEILEPRVALGLASNLEGLTHRTH- RMETARGSERPRVALPHETHRA 



Seq2 GLYSERHISMET SERPRVALPHETHRA 

10 20 

530 540 550 560 570 580 

Seql SPASNSERSERPRPRALAVALPRGLNSERPHEGLNVALALAHISLEHISALAPRTHRGLY 



Seq2 SPASNSERSERPRPRALAVALPRGLNSERPHEGLNVALALAHISLEHISALAPRTHRGLY 
30 • 40 50 60 70 80 

590 600 610 620 630 640 

Seql SERGLYLYSSERTHRLYSVALPRALAALATYRALAALAGLNGLYTYRLYSVALLEVALLE 



Seq2 SERGLYLYSSERTHRLYSVALPRAIiAALATYRALAALAGLNGLYTYRLYSVALLEVALLE 
90 100 110 120 130 140 

650 660 670 680 690 700 

Seql ASNPRSERVALALAALATHRLEGLYPHEGLYALATYRMETSERLYSALAHISGLYVAIiAS 



Seq2 ASNPRSERVALAIiAALATHRLEGLYPHEGLYALATYRMETSERLYSALAHI SGLYVALAS 
150 160 170 180 190 200 

710 720 730 740 750 760 

Seql PPRASNILEARGTHRGLYVALARGTHRILETHRTHRGLYSERPRILETHRTYRSERTHRT 

***** »••••■ *•**•*•**•**•****•■■••**••*«««***«*« mm • 

Seq2 PPRASNILEARGTHRGLYVALARGTHRILETHRTHRGLYSERPRILETHRTYRSERTHRT 
210 220 230 240 250 260 

770 780 790 800 810 820 

Seql YRG L YL YS PHE LEALAAS PGL YGL YC YS SE RG L YGL YALAT YRAS P I LE I LE I LE C YS AS 

***** •*•*•* *«****«ft*********ftft****ft|p V *c. aaBB ft*******pp* 

Seq2 YRGLYLYSPHELEALAASPGLYGLYCYSSERGLYGLYALATYRASPILEILEILECYSAS 
270 280 290 300 310 320 

830 840 850 860 870 880 

Seql PG LC YS H I S S E RTHRAS P ALATHRS ER I LE LEG L Y I LEGL YTHRVALLEAS PGLNALAGL 

■*•••****•* ••■••**•■»••••»••••*•♦»»■** ■ • • 

***■•••••*•***»•**••••*•**•■*•»*•••••*■*»••■••••*•••••*•**♦* 

Seq2 PGLCYSHI SSERTHRAS PALATHRSERI LELEGLYI LEGLYTHRVALLEAS PGLNALAGL 
330 340 350 360 370 380 

890 900 910 920 930 940 

Seql ' THRALAGLYALAARGLEVALVALLEALATHRALATHRPRPRGLYSERVALTHRVALPRHI 

**•■■••*•**■* ••**ft**»**p****ft 

Seq2 THRALAGLYALAARGLEVALVALLEALATHRALATHRPRPRGLYSERVALTHRVALPRHI 
390 400 410 420 430 440 

950 960 970 980 990 1000 

Seql SPRASNILEGLGLVALALALESERTHRTHRGLYGLILEPRPHETYRGLYLYSALAILEPR 



Seq2 S PRASNI LEGLGLVALALALE S ERTHRTHRGLYGL I LE PRPHETYRGLYLYS ALAI LE PR 
450 460 470 480 490 500 

1010 1020 . 1030 1040 1050 1060 

Seql LEGLVALILELYSGLYGLYARGHISLEILEPHECYSHISSERLYSLYSLYSCYSASPGLL 



Seq2 



LEGLVALILELYSGLYGLYARGHISLEILEPHECYSHISSERLYSLYSLYSCYSASPGLL 
510 520 530 540 550 560 
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1070 1080 1090 1100 1110 1120 

Seql EALAAIJUJYSLEVALALALEGLYILEASNAIAV 



Seq2 EALAALALYS LEVALALALEGLY I LEASNALAVALALATYRTYRARGGLYLEAS PVALSE 
570 580 590 600 610 620 

1130 1140 1150 1160 1170 1180 

S eql RVALILE PRTHRAS NG L YAS PVALVALVALVALALATHRAS PALALEMETTHRGLYPHET 



S eq2 RVALILE PRTHRAS NGLYAS PVALVALVALVALALATHRAS PALALEMETTHRGLYPHET 
630 640 650 660 670 680 

1190 1200 1210 1220 1230 1240 

Seql HRGLYASPPHEASPSERVALILEASPCYSASNTHRCYSVALTHRGLNTHRVALASPPHES 



Seq2 HRGLYASPPHEASPSERVALILEASPCYSASNTHR S 

690 700 710 720 

1250 1260 1270 , 1280 1290 1300 

ERLEASPPRTHRPHETHRILEGLTHRTHRTHRLEPRGLNASPALAVALSERARGTHRGLN 



ER- -ASP GL YLYS PRGLNAS PALAVALSERARGTHRGLN 

730 740 750 760 

1310 1320 1330 1340 1350 1360 

Seql ARGARGGLYARGTHRGLYARGGLYLYS PRGLYI LETYRARG PHEVALALAPRGLYGLARG 



Seq2 ARGARGGLYARGTHRGLYARGGLYLYS PRGLYI LETYRARG PHEVALALAPRGLYGLARG 

770 780 790 800 810 820 

1370 1380 1390 1400 1410 1420 

Seql PRSERGLYMETPHEASPSERSERVALLECYSGLCYSTYRASPALAGLYCYSALATRPTYR 



